Overview

Dataset statistics

Number of variables50
Number of observations101766
Missing cells181168
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory38.8 MiB
Average record size in memory400.0 B

Variable types

Numeric13
Categorical30
Text4
Boolean3

Alerts

examide has constant value "False"Constant
citoglipton has constant value "False"Constant
A1Cresult is highly overall correlated with acetohexamide and 3 other fieldsHigh correlation
acetohexamide is highly overall correlated with A1Cresult and 1 other fieldsHigh correlation
change is highly overall correlated with diabetesMed and 1 other fieldsHigh correlation
diabetesMed is highly overall correlated with change and 1 other fieldsHigh correlation
encounter_id is highly overall correlated with patient_nbrHigh correlation
glimepiride-pioglitazone is highly overall correlated with A1Cresult and 1 other fieldsHigh correlation
glipizide-metformin is highly overall correlated with max_glu_serumHigh correlation
insulin is highly overall correlated with change and 1 other fieldsHigh correlation
max_glu_serum is highly overall correlated with acetohexamide and 7 other fieldsHigh correlation
metformin-pioglitazone is highly overall correlated with A1Cresult and 1 other fieldsHigh correlation
metformin-rosiglitazone is highly overall correlated with max_glu_serumHigh correlation
miglitol is highly overall correlated with max_glu_serumHigh correlation
patient_nbr is highly overall correlated with encounter_idHigh correlation
troglitazone is highly overall correlated with A1Cresult and 1 other fieldsHigh correlation
weight is highly overall correlated with max_glu_serumHigh correlation
race is highly imbalanced (55.9%)Imbalance
weight is highly imbalanced (92.0%)Imbalance
metformin is highly imbalanced (59.5%)Imbalance
repaglinide is highly imbalanced (93.9%)Imbalance
nateglinide is highly imbalanced (96.9%)Imbalance
chlorpropamide is highly imbalanced (99.5%)Imbalance
glimepiride is highly imbalanced (84.0%)Imbalance
acetohexamide is highly imbalanced (> 99.9%)Imbalance
glipizide is highly imbalanced (69.2%)Imbalance
glyburide is highly imbalanced (72.3%)Imbalance
tolbutamide is highly imbalanced (99.7%)Imbalance
pioglitazone is highly imbalanced (80.2%)Imbalance
rosiglitazone is highly imbalanced (82.2%)Imbalance
acarbose is highly imbalanced (98.5%)Imbalance
miglitol is highly imbalanced (99.7%)Imbalance
troglitazone is highly imbalanced (> 99.9%)Imbalance
tolazamide is highly imbalanced (99.7%)Imbalance
glyburide-metformin is highly imbalanced (97.0%)Imbalance
glipizide-metformin is highly imbalanced (99.8%)Imbalance
glimepiride-pioglitazone is highly imbalanced (> 99.9%)Imbalance
metformin-rosiglitazone is highly imbalanced (> 99.9%)Imbalance
metformin-pioglitazone is highly imbalanced (> 99.9%)Imbalance
max_glu_serum has 96420 (94.7%) missing valuesMissing
A1Cresult has 84748 (83.3%) missing valuesMissing
number_emergency is highly skewed (γ1 = 22.85558215)Skewed
encounter_id has unique valuesUnique
num_procedures has 46652 (45.8%) zerosZeros
number_outpatient has 85027 (83.6%) zerosZeros
number_emergency has 90383 (88.8%) zerosZeros
number_inpatient has 67630 (66.5%) zerosZeros

Reproduction

Analysis started2025-11-30 11:33:06.677376
Analysis finished2025-11-30 11:34:38.628700
Duration1 minute and 31.95 seconds
Software versionydata-profiling vv4.18.0
Download configurationconfig.json

Variables

encounter_id
Real number (ℝ)

High correlation  Unique 

Distinct101766
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6520165 × 108
Minimum12522
Maximum4.4386722 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:38.872471image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum12522
5-th percentile27170784
Q184961194
median1.5238899 × 108
Q32.3027089 × 108
95-th percentile3.7896284 × 108
Maximum4.4386722 × 108
Range4.438547 × 108
Interquartile range (IQR)1.4530969 × 108

Descriptive statistics

Standard deviation1.026403 × 108
Coefficient of variation (CV)0.62130311
Kurtosis-0.10207139
Mean1.6520165 × 108
Median Absolute Deviation (MAD)70921143
Skewness0.69914155
Sum1.6811911 × 1013
Variance1.053503 × 1016
MonotonicityNot monotonic
2025-11-30T12:34:39.206470image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4438672221
 
< 0.1%
22783921
 
< 0.1%
1491901
 
< 0.1%
644101
 
< 0.1%
5003641
 
< 0.1%
166801
 
< 0.1%
4438160241
 
< 0.1%
4438115361
 
< 0.1%
4438045701
 
< 0.1%
4437972981
 
< 0.1%
Other values (101756)101756
> 99.9%
ValueCountFrequency (%)
125221
< 0.1%
157381
< 0.1%
166801
< 0.1%
282361
< 0.1%
357541
< 0.1%
369001
< 0.1%
409261
< 0.1%
425701
< 0.1%
558421
< 0.1%
622561
< 0.1%
ValueCountFrequency (%)
4438672221
< 0.1%
4438571661
< 0.1%
4438541481
< 0.1%
4438477821
< 0.1%
4438475481
< 0.1%
4438471761
< 0.1%
4438427781
< 0.1%
4438423401
< 0.1%
4438421361
< 0.1%
4438420701
< 0.1%

patient_nbr
Real number (ℝ)

High correlation 

Distinct71518
Distinct (%)70.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54330401
Minimum135
Maximum1.8950262 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:39.463652image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum135
5-th percentile1456971.8
Q123413221
median45505143
Q387545950
95-th percentile1.1148027 × 108
Maximum1.8950262 × 108
Range1.8950248 × 108
Interquartile range (IQR)64132729

Descriptive statistics

Standard deviation38696359
Coefficient of variation (CV)0.71224138
Kurtosis-0.34737204
Mean54330401
Median Absolute Deviation (MAD)32950134
Skewness0.47128072
Sum5.5289876 × 1012
Variance1.4974082 × 1015
MonotonicityNot monotonic
2025-11-30T12:34:39.737839image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8878589140
 
< 0.1%
4314090628
 
< 0.1%
8822754023
 
< 0.1%
166029323
 
< 0.1%
2319902123
 
< 0.1%
8442861322
 
< 0.1%
2364340522
 
< 0.1%
9270935121
 
< 0.1%
3709686620
 
< 0.1%
9060980420
 
< 0.1%
Other values (71508)101524
99.8%
ValueCountFrequency (%)
1352
 
< 0.1%
3781
 
< 0.1%
7291
 
< 0.1%
7741
 
< 0.1%
9271
 
< 0.1%
11525
< 0.1%
13051
 
< 0.1%
13143
< 0.1%
16291
 
< 0.1%
20251
 
< 0.1%
ValueCountFrequency (%)
1895026191
< 0.1%
1894814781
< 0.1%
1894451271
< 0.1%
1893658641
< 0.1%
1893510951
< 0.1%
1893494301
< 0.1%
1893320871
< 0.1%
1892988771
< 0.1%
1892578462
< 0.1%
1892157621
< 0.1%

race
Categorical

Imbalance 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
Caucasian
76099 
AfricanAmerican
19210 
?
 
2273
Hispanic
 
2037
Other
 
1506

Length

Max length15
Median length9
Mean length9.8495077
Min length1

Characters and Unicode

Total characters1002345
Distinct characters18
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCaucasian
2nd rowCaucasian
3rd rowAfricanAmerican
4th rowCaucasian
5th rowCaucasian

Common Values

ValueCountFrequency (%)
Caucasian76099
74.8%
AfricanAmerican19210
 
18.9%
?2273
 
2.2%
Hispanic2037
 
2.0%
Other1506
 
1.5%
Asian641
 
0.6%

Length

2025-11-30T12:34:39.987739image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:40.158409image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
caucasian76099
74.8%
africanamerican19210
 
18.9%
2273
 
2.2%
hispanic2037
 
2.0%
other1506
 
1.5%
asian641
 
0.6%

Most occurring characters

ValueCountFrequency (%)
a269395
26.9%
i119234
11.9%
n117197
11.7%
c116556
11.6%
s78777
 
7.9%
C76099
 
7.6%
u76099
 
7.6%
r39926
 
4.0%
A39061
 
3.9%
e20716
 
2.1%
Other values (8)49285
 
4.9%

Most occurring categories

ValueCountFrequency (%)
(unknown)1002345
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a269395
26.9%
i119234
11.9%
n117197
11.7%
c116556
11.6%
s78777
 
7.9%
C76099
 
7.6%
u76099
 
7.6%
r39926
 
4.0%
A39061
 
3.9%
e20716
 
2.1%
Other values (8)49285
 
4.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown)1002345
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a269395
26.9%
i119234
11.9%
n117197
11.7%
c116556
11.6%
s78777
 
7.9%
C76099
 
7.6%
u76099
 
7.6%
r39926
 
4.0%
A39061
 
3.9%
e20716
 
2.1%
Other values (8)49285
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown)1002345
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a269395
26.9%
i119234
11.9%
n117197
11.7%
c116556
11.6%
s78777
 
7.9%
C76099
 
7.6%
u76099
 
7.6%
r39926
 
4.0%
A39061
 
3.9%
e20716
 
2.1%
Other values (8)49285
 
4.9%

gender
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
Female
54708 
Male
47055 
Unknown/Invalid
 
3

Length

Max length15
Median length6
Mean length5.0754967
Min length4

Characters and Unicode

Total characters516513
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFemale
2nd rowFemale
3rd rowFemale
4th rowMale
5th rowMale

Common Values

ValueCountFrequency (%)
Female54708
53.8%
Male47055
46.2%
Unknown/Invalid3
 
< 0.1%

Length

2025-11-30T12:34:40.366706image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:40.498996image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
female54708
53.8%
male47055
46.2%
unknown/invalid3
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e156471
30.3%
a101766
19.7%
l101766
19.7%
F54708
 
10.6%
m54708
 
10.6%
M47055
 
9.1%
n12
 
< 0.1%
U3
 
< 0.1%
k3
 
< 0.1%
o3
 
< 0.1%
Other values (6)18
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)516513
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e156471
30.3%
a101766
19.7%
l101766
19.7%
F54708
 
10.6%
m54708
 
10.6%
M47055
 
9.1%
n12
 
< 0.1%
U3
 
< 0.1%
k3
 
< 0.1%
o3
 
< 0.1%
Other values (6)18
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)516513
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e156471
30.3%
a101766
19.7%
l101766
19.7%
F54708
 
10.6%
m54708
 
10.6%
M47055
 
9.1%
n12
 
< 0.1%
U3
 
< 0.1%
k3
 
< 0.1%
o3
 
< 0.1%
Other values (6)18
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)516513
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e156471
30.3%
a101766
19.7%
l101766
19.7%
F54708
 
10.6%
m54708
 
10.6%
M47055
 
9.1%
n12
 
< 0.1%
U3
 
< 0.1%
k3
 
< 0.1%
o3
 
< 0.1%
Other values (6)18
 
< 0.1%

age
Categorical

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
[70-80)
26068 
[60-70)
22483 
[50-60)
17256 
[80-90)
17197 
[40-50)
9685 
Other values (5)
9077 

Length

Max length8
Median length7
Mean length7.0258633
Min length6

Characters and Unicode

Total characters714994
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[0-10)
2nd row[10-20)
3rd row[20-30)
4th row[30-40)
5th row[40-50)

Common Values

ValueCountFrequency (%)
[70-80)26068
25.6%
[60-70)22483
22.1%
[50-60)17256
17.0%
[80-90)17197
16.9%
[40-50)9685
 
9.5%
[30-40)3775
 
3.7%
[90-100)2793
 
2.7%
[20-30)1657
 
1.6%
[10-20)691
 
0.7%
[0-10)161
 
0.2%

Length

2025-11-30T12:34:40.684570image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:41.052004image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
70-8026068
25.6%
60-7022483
22.1%
50-6017256
17.0%
80-9017197
16.9%
40-509685
 
9.5%
30-403775
 
3.7%
90-1002793
 
2.7%
20-301657
 
1.6%
10-20691
 
0.7%
0-10161
 
0.2%

Most occurring characters

ValueCountFrequency (%)
0206325
28.9%
[101766
14.2%
-101766
14.2%
)101766
14.2%
748551
 
6.8%
843265
 
6.1%
639739
 
5.6%
526941
 
3.8%
919990
 
2.8%
413460
 
1.9%
Other values (3)11425
 
1.6%

Most occurring categories

ValueCountFrequency (%)
(unknown)714994
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0206325
28.9%
[101766
14.2%
-101766
14.2%
)101766
14.2%
748551
 
6.8%
843265
 
6.1%
639739
 
5.6%
526941
 
3.8%
919990
 
2.8%
413460
 
1.9%
Other values (3)11425
 
1.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown)714994
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0206325
28.9%
[101766
14.2%
-101766
14.2%
)101766
14.2%
748551
 
6.8%
843265
 
6.1%
639739
 
5.6%
526941
 
3.8%
919990
 
2.8%
413460
 
1.9%
Other values (3)11425
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown)714994
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0206325
28.9%
[101766
14.2%
-101766
14.2%
)101766
14.2%
748551
 
6.8%
843265
 
6.1%
639739
 
5.6%
526941
 
3.8%
919990
 
2.8%
413460
 
1.9%
Other values (3)11425
 
1.6%

weight
Categorical

High correlation  Imbalance 

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
?
98569 
[75-100)
 
1336
[50-75)
 
897
[100-125)
 
625
[125-150)
 
145
Other values (5)
 
194

Length

Max length9
Median length1
Mean length1.2170961
Min length1

Characters and Unicode

Total characters123859
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row?
2nd row?
3rd row?
4th row?
5th row?

Common Values

ValueCountFrequency (%)
?98569
96.9%
[75-100)1336
 
1.3%
[50-75)897
 
0.9%
[100-125)625
 
0.6%
[125-150)145
 
0.1%
[25-50)97
 
0.1%
[0-25)48
 
< 0.1%
[150-175)35
 
< 0.1%
[175-200)11
 
< 0.1%
>2003
 
< 0.1%

Length

2025-11-30T12:34:41.450755image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:41.710946image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
98569
96.9%
75-1001336
 
1.3%
50-75897
 
0.9%
100-125625
 
0.6%
125-150145
 
0.1%
25-5097
 
0.1%
0-2548
 
< 0.1%
150-17535
 
< 0.1%
175-20011
 
< 0.1%
2003
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
?98569
79.6%
05172
 
4.2%
54368
 
3.5%
[3194
 
2.6%
)3194
 
2.6%
-3194
 
2.6%
12957
 
2.4%
72279
 
1.8%
2929
 
0.8%
>3
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)123859
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
?98569
79.6%
05172
 
4.2%
54368
 
3.5%
[3194
 
2.6%
)3194
 
2.6%
-3194
 
2.6%
12957
 
2.4%
72279
 
1.8%
2929
 
0.8%
>3
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)123859
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
?98569
79.6%
05172
 
4.2%
54368
 
3.5%
[3194
 
2.6%
)3194
 
2.6%
-3194
 
2.6%
12957
 
2.4%
72279
 
1.8%
2929
 
0.8%
>3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)123859
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
?98569
79.6%
05172
 
4.2%
54368
 
3.5%
[3194
 
2.6%
)3194
 
2.6%
-3194
 
2.6%
12957
 
2.4%
72279
 
1.8%
2929
 
0.8%
>3
 
< 0.1%

admission_type_id
Real number (ℝ)

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0240061
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:41.923820image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile6
Maximum8
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.4454028
Coefficient of variation (CV)0.7141297
Kurtosis1.9424761
Mean2.0240061
Median Absolute Deviation (MAD)0
Skewness1.5919843
Sum205975
Variance2.0891893
MonotonicityNot monotonic
2025-11-30T12:34:42.079887image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
153990
53.1%
318869
 
18.5%
218480
 
18.2%
65291
 
5.2%
54785
 
4.7%
8320
 
0.3%
721
 
< 0.1%
410
 
< 0.1%
ValueCountFrequency (%)
153990
53.1%
218480
 
18.2%
318869
 
18.5%
410
 
< 0.1%
54785
 
4.7%
65291
 
5.2%
721
 
< 0.1%
8320
 
0.3%
ValueCountFrequency (%)
8320
 
0.3%
721
 
< 0.1%
65291
 
5.2%
54785
 
4.7%
410
 
< 0.1%
318869
 
18.5%
218480
 
18.2%
153990
53.1%

discharge_disposition_id
Real number (ℝ)

Distinct26
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.7156418
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:42.287215image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q34
95-th percentile18
Maximum28
Range27
Interquartile range (IQR)3

Descriptive statistics

Standard deviation5.2801655
Coefficient of variation (CV)1.4210642
Kurtosis6.0033468
Mean3.7156418
Median Absolute Deviation (MAD)0
Skewness2.563067
Sum378126
Variance27.880148
MonotonicityNot monotonic
2025-11-30T12:34:42.614433image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
160234
59.2%
313954
 
13.7%
612902
 
12.7%
183691
 
3.6%
22128
 
2.1%
221993
 
2.0%
111642
 
1.6%
51184
 
1.2%
25989
 
1.0%
4815
 
0.8%
Other values (16)2234
 
2.2%
ValueCountFrequency (%)
160234
59.2%
22128
 
2.1%
313954
 
13.7%
4815
 
0.8%
51184
 
1.2%
612902
 
12.7%
7623
 
0.6%
8108
 
0.1%
921
 
< 0.1%
106
 
< 0.1%
ValueCountFrequency (%)
28139
 
0.1%
275
 
< 0.1%
25989
 
1.0%
2448
 
< 0.1%
23412
 
0.4%
221993
2.0%
202
 
< 0.1%
198
 
< 0.1%
183691
3.6%
1714
 
< 0.1%

admission_source_id
Real number (ℝ)

Distinct17
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.7544366
Minimum1
Maximum25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:42.867321image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median7
Q37
95-th percentile17
Maximum25
Range24
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.0640808
Coefficient of variation (CV)0.70625173
Kurtosis1.7449894
Mean5.7544366
Median Absolute Deviation (MAD)0
Skewness1.0299349
Sum585606
Variance16.516753
MonotonicityNot monotonic
2025-11-30T12:34:43.088384image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
757494
56.5%
129565
29.1%
176781
 
6.7%
43187
 
3.1%
62264
 
2.2%
21104
 
1.1%
5855
 
0.8%
3187
 
0.2%
20161
 
0.2%
9125
 
0.1%
Other values (7)43
 
< 0.1%
ValueCountFrequency (%)
129565
29.1%
21104
 
1.1%
3187
 
0.2%
43187
 
3.1%
5855
 
0.8%
62264
 
2.2%
757494
56.5%
816
 
< 0.1%
9125
 
0.1%
108
 
< 0.1%
ValueCountFrequency (%)
252
 
< 0.1%
2212
 
< 0.1%
20161
 
0.2%
176781
6.7%
142
 
< 0.1%
131
 
< 0.1%
112
 
< 0.1%
108
 
< 0.1%
9125
 
0.1%
816
 
< 0.1%

time_in_hospital
Real number (ℝ)

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.3959869
Minimum1
Maximum14
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:43.286322image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile11
Maximum14
Range13
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.9851078
Coefficient of variation (CV)0.67905293
Kurtosis0.85025084
Mean4.3959869
Median Absolute Deviation (MAD)2
Skewness1.1339987
Sum447362
Variance8.9108684
MonotonicityNot monotonic
2025-11-30T12:34:43.500120image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
317756
17.4%
217224
16.9%
114208
14.0%
413924
13.7%
59966
9.8%
67539
7.4%
75859
 
5.8%
84391
 
4.3%
93002
 
2.9%
102342
 
2.3%
Other values (4)5555
 
5.5%
ValueCountFrequency (%)
114208
14.0%
217224
16.9%
317756
17.4%
413924
13.7%
59966
9.8%
67539
7.4%
75859
 
5.8%
84391
 
4.3%
93002
 
2.9%
102342
 
2.3%
ValueCountFrequency (%)
141042
 
1.0%
131210
 
1.2%
121448
 
1.4%
111855
 
1.8%
102342
 
2.3%
93002
 
2.9%
84391
4.3%
75859
5.8%
67539
7.4%
59966
9.8%

payer_code
Categorical

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
?
40256 
MC
32439 
HM
6274 
SP
5007 
BC
4655 
Other values (13)
13135 

Length

Max length2
Median length2
Mean length1.6044258
Min length1

Characters and Unicode

Total characters163276
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row?
2nd row?
3rd row?
4th row?
5th row?

Common Values

ValueCountFrequency (%)
?40256
39.6%
MC32439
31.9%
HM6274
 
6.2%
SP5007
 
4.9%
BC4655
 
4.6%
MD3532
 
3.5%
CP2533
 
2.5%
UN2448
 
2.4%
CM1937
 
1.9%
OG1033
 
1.0%
Other values (8)1652
 
1.6%

Length

2025-11-30T12:34:43.723204image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
40256
39.6%
mc32439
31.9%
hm6274
 
6.2%
sp5007
 
4.9%
bc4655
 
4.6%
md3532
 
3.5%
cp2533
 
2.5%
un2448
 
2.4%
cm1937
 
1.9%
og1033
 
1.0%
Other values (8)1652
 
1.6%

Most occurring characters

ValueCountFrequency (%)
M44810
27.4%
C41845
25.6%
?40256
24.7%
P8211
 
5.0%
H6420
 
3.9%
S5062
 
3.1%
B4655
 
2.9%
D4081
 
2.5%
U2448
 
1.5%
N2448
 
1.5%
Other values (7)3040
 
1.9%

Most occurring categories

ValueCountFrequency (%)
(unknown)163276
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
M44810
27.4%
C41845
25.6%
?40256
24.7%
P8211
 
5.0%
H6420
 
3.9%
S5062
 
3.1%
B4655
 
2.9%
D4081
 
2.5%
U2448
 
1.5%
N2448
 
1.5%
Other values (7)3040
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown)163276
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
M44810
27.4%
C41845
25.6%
?40256
24.7%
P8211
 
5.0%
H6420
 
3.9%
S5062
 
3.1%
B4655
 
2.9%
D4081
 
2.5%
U2448
 
1.5%
N2448
 
1.5%
Other values (7)3040
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown)163276
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
M44810
27.4%
C41845
25.6%
?40256
24.7%
P8211
 
5.0%
H6420
 
3.9%
S5062
 
3.1%
B4655
 
2.9%
D4081
 
2.5%
U2448
 
1.5%
N2448
 
1.5%
Other values (7)3040
 
1.9%
Distinct73
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:44.140541image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length36
Median length33
Mean length8.6126702
Min length1

Characters and Unicode

Total characters876477
Distinct characters44
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)< 0.1%

Sample

1st rowPediatrics-Endocrinology
2nd row?
3rd row?
4th row?
5th row?
ValueCountFrequency (%)
49949
49.1%
internalmedicine14635
 
14.4%
emergency/trauma7565
 
7.4%
family/generalpractice7440
 
7.3%
cardiology5352
 
5.3%
surgery-general3099
 
3.0%
nephrology1613
 
1.6%
orthopedics1400
 
1.4%
orthopedics-reconstructive1233
 
1.2%
radiologist1140
 
1.1%
Other values (63)8340
 
8.2%
2025-11-30T12:34:44.804965image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e105151
 
12.0%
r76899
 
8.8%
a71149
 
8.1%
n68798
 
7.8%
i63308
 
7.2%
c50007
 
5.7%
?49949
 
5.7%
l48871
 
5.6%
y34937
 
4.0%
t34149
 
3.9%
Other values (34)273259
31.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)876477
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e105151
 
12.0%
r76899
 
8.8%
a71149
 
8.1%
n68798
 
7.8%
i63308
 
7.2%
c50007
 
5.7%
?49949
 
5.7%
l48871
 
5.6%
y34937
 
4.0%
t34149
 
3.9%
Other values (34)273259
31.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)876477
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e105151
 
12.0%
r76899
 
8.8%
a71149
 
8.1%
n68798
 
7.8%
i63308
 
7.2%
c50007
 
5.7%
?49949
 
5.7%
l48871
 
5.6%
y34937
 
4.0%
t34149
 
3.9%
Other values (34)273259
31.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)876477
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e105151
 
12.0%
r76899
 
8.8%
a71149
 
8.1%
n68798
 
7.8%
i63308
 
7.2%
c50007
 
5.7%
?49949
 
5.7%
l48871
 
5.6%
y34937
 
4.0%
t34149
 
3.9%
Other values (34)273259
31.2%

num_lab_procedures
Real number (ℝ)

Distinct118
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.095641
Minimum1
Maximum132
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:45.023880image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q131
median44
Q357
95-th percentile73
Maximum132
Range131
Interquartile range (IQR)26

Descriptive statistics

Standard deviation19.674362
Coefficient of variation (CV)0.45652789
Kurtosis-0.24507352
Mean43.095641
Median Absolute Deviation (MAD)13
Skewness-0.23654392
Sum4385671
Variance387.08053
MonotonicityNot monotonic
2025-11-30T12:34:45.260253image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13208
 
3.2%
432804
 
2.8%
442496
 
2.5%
452376
 
2.3%
382213
 
2.2%
402201
 
2.2%
462189
 
2.2%
412117
 
2.1%
422113
 
2.1%
472106
 
2.1%
Other values (108)77943
76.6%
ValueCountFrequency (%)
13208
3.2%
21101
 
1.1%
3668
 
0.7%
4378
 
0.4%
5286
 
0.3%
6282
 
0.3%
7323
 
0.3%
8366
 
0.4%
9933
 
0.9%
10838
 
0.8%
ValueCountFrequency (%)
1321
 
< 0.1%
1291
 
< 0.1%
1261
 
< 0.1%
1211
 
< 0.1%
1201
 
< 0.1%
1181
 
< 0.1%
1142
< 0.1%
1133
< 0.1%
1113
< 0.1%
1094
< 0.1%

num_procedures
Real number (ℝ)

Zeros 

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3397304
Minimum0
Maximum6
Zeros46652
Zeros (%)45.8%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:45.428754image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.705807
Coefficient of variation (CV)1.2732465
Kurtosis0.8571103
Mean1.3397304
Median Absolute Deviation (MAD)1
Skewness1.3164148
Sum136339
Variance2.9097775
MonotonicityNot monotonic
2025-11-30T12:34:45.566831image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
046652
45.8%
120742
20.4%
212717
 
12.5%
39443
 
9.3%
64954
 
4.9%
44180
 
4.1%
53078
 
3.0%
ValueCountFrequency (%)
046652
45.8%
120742
20.4%
212717
 
12.5%
39443
 
9.3%
44180
 
4.1%
53078
 
3.0%
64954
 
4.9%
ValueCountFrequency (%)
64954
 
4.9%
53078
 
3.0%
44180
 
4.1%
39443
 
9.3%
212717
 
12.5%
120742
20.4%
046652
45.8%

num_medications
Real number (ℝ)

Distinct75
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.021844
Minimum1
Maximum81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:45.787331image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q110
median15
Q320
95-th percentile31
Maximum81
Range80
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.1275662
Coefficient of variation (CV)0.50728032
Kurtosis3.4681549
Mean16.021844
Median Absolute Deviation (MAD)5
Skewness1.3266721
Sum1630479
Variance66.057332
MonotonicityNot monotonic
2025-11-30T12:34:46.073096image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
136086
 
6.0%
126004
 
5.9%
115795
 
5.7%
155792
 
5.7%
145707
 
5.6%
165430
 
5.3%
105346
 
5.3%
174919
 
4.8%
94913
 
4.8%
184523
 
4.4%
Other values (65)47251
46.4%
ValueCountFrequency (%)
1262
 
0.3%
2470
 
0.5%
3900
 
0.9%
41417
 
1.4%
52017
 
2.0%
62699
2.7%
73484
3.4%
84353
4.3%
94913
4.8%
105346
5.3%
ValueCountFrequency (%)
811
 
< 0.1%
791
 
< 0.1%
752
 
< 0.1%
741
 
< 0.1%
723
< 0.1%
702
 
< 0.1%
695
< 0.1%
687
< 0.1%
677
< 0.1%
665
< 0.1%

number_outpatient
Real number (ℝ)

Zeros 

Distinct39
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.36935715
Minimum0
Maximum42
Zeros85027
Zeros (%)83.6%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:46.438563image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum42
Range42
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.2672651
Coefficient of variation (CV)3.4310019
Kurtosis147.90774
Mean0.36935715
Median Absolute Deviation (MAD)0
Skewness8.8329589
Sum37588
Variance1.6059608
MonotonicityNot monotonic
2025-11-30T12:34:46.639179image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
085027
83.6%
18547
 
8.4%
23594
 
3.5%
32042
 
2.0%
41099
 
1.1%
5533
 
0.5%
6303
 
0.3%
7155
 
0.2%
898
 
0.1%
983
 
0.1%
Other values (29)285
 
0.3%
ValueCountFrequency (%)
085027
83.6%
18547
 
8.4%
23594
 
3.5%
32042
 
2.0%
41099
 
1.1%
5533
 
0.5%
6303
 
0.3%
7155
 
0.2%
898
 
0.1%
983
 
0.1%
ValueCountFrequency (%)
421
< 0.1%
401
< 0.1%
391
< 0.1%
381
< 0.1%
371
< 0.1%
362
< 0.1%
352
< 0.1%
341
< 0.1%
332
< 0.1%
292
< 0.1%

number_emergency
Real number (ℝ)

Skewed  Zeros 

Distinct33
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.19783621
Minimum0
Maximum76
Zeros90383
Zeros (%)88.8%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:46.860679image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum76
Range76
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.93047227
Coefficient of variation (CV)4.7032455
Kurtosis1191.6867
Mean0.19783621
Median Absolute Deviation (MAD)0
Skewness22.855582
Sum20133
Variance0.86577864
MonotonicityNot monotonic
2025-11-30T12:34:47.085713image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
090383
88.8%
17677
 
7.5%
22042
 
2.0%
3725
 
0.7%
4374
 
0.4%
5192
 
0.2%
694
 
0.1%
773
 
0.1%
850
 
< 0.1%
1034
 
< 0.1%
Other values (23)122
 
0.1%
ValueCountFrequency (%)
090383
88.8%
17677
 
7.5%
22042
 
2.0%
3725
 
0.7%
4374
 
0.4%
5192
 
0.2%
694
 
0.1%
773
 
0.1%
850
 
< 0.1%
933
 
< 0.1%
ValueCountFrequency (%)
761
< 0.1%
641
< 0.1%
631
< 0.1%
541
< 0.1%
461
< 0.1%
421
< 0.1%
371
< 0.1%
291
< 0.1%
281
< 0.1%
252
< 0.1%

number_inpatient
Real number (ℝ)

Zeros 

Distinct21
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.63556591
Minimum0
Maximum21
Zeros67630
Zeros (%)66.5%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:47.277448image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum21
Range21
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.2628633
Coefficient of variation (CV)1.9869903
Kurtosis20.719397
Mean0.63556591
Median Absolute Deviation (MAD)0
Skewness3.614139
Sum64679
Variance1.5948237
MonotonicityNot monotonic
2025-11-30T12:34:47.481352image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
067630
66.5%
119521
 
19.2%
27566
 
7.4%
33411
 
3.4%
41622
 
1.6%
5812
 
0.8%
6480
 
0.5%
7268
 
0.3%
8151
 
0.1%
9111
 
0.1%
Other values (11)194
 
0.2%
ValueCountFrequency (%)
067630
66.5%
119521
 
19.2%
27566
 
7.4%
33411
 
3.4%
41622
 
1.6%
5812
 
0.8%
6480
 
0.5%
7268
 
0.3%
8151
 
0.1%
9111
 
0.1%
ValueCountFrequency (%)
211
 
< 0.1%
192
 
< 0.1%
181
 
< 0.1%
171
 
< 0.1%
166
 
< 0.1%
159
 
< 0.1%
1410
 
< 0.1%
1320
< 0.1%
1234
< 0.1%
1149
< 0.1%

diag_1
Text

Distinct717
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:48.556528image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.1752157
Min length1

Characters and Unicode

Total characters323129
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)0.1%

Sample

1st row250.83
2nd row276
3rd row648
4th row8
5th row197
ValueCountFrequency (%)
4286862
 
6.7%
4146581
 
6.5%
7864016
 
3.9%
4103614
 
3.6%
4863508
 
3.4%
4272766
 
2.7%
4912275
 
2.2%
7152151
 
2.1%
6822042
 
2.0%
4342028
 
2.0%
Other values (707)65923
64.8%
2025-11-30T12:34:49.813755image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
455457
17.2%
239876
12.3%
837949
11.7%
537131
11.5%
728668
8.9%
128106
8.7%
024960
7.7%
623198
7.2%
919978
 
6.2%
317618
 
5.5%
Other values (4)10188
 
3.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)323129
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
455457
17.2%
239876
12.3%
837949
11.7%
537131
11.5%
728668
8.9%
128106
8.7%
024960
7.7%
623198
7.2%
919978
 
6.2%
317618
 
5.5%
Other values (4)10188
 
3.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)323129
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
455457
17.2%
239876
12.3%
837949
11.7%
537131
11.5%
728668
8.9%
128106
8.7%
024960
7.7%
623198
7.2%
919978
 
6.2%
317618
 
5.5%
Other values (4)10188
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)323129
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
455457
17.2%
239876
12.3%
837949
11.7%
537131
11.5%
728668
8.9%
128106
8.7%
024960
7.7%
623198
7.2%
919978
 
6.2%
317618
 
5.5%
Other values (4)10188
 
3.2%

diag_2
Text

Distinct749
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:50.828059image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.166195
Min length1

Characters and Unicode

Total characters322211
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)0.1%

Sample

1st row?
2nd row250.01
3rd row250
4th row250.43
5th row157
ValueCountFrequency (%)
2766752
 
6.6%
4286662
 
6.5%
2506071
 
6.0%
4275036
 
4.9%
4013736
 
3.7%
4963305
 
3.2%
5993288
 
3.2%
4032823
 
2.8%
4142650
 
2.6%
4112566
 
2.5%
Other values (739)58877
57.9%
2025-11-30T12:34:52.070906image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
451155
15.9%
249765
15.4%
538176
11.8%
034046
10.6%
828711
8.9%
728654
8.9%
126158
8.1%
921842
6.8%
619990
 
6.2%
314097
 
4.4%
Other values (4)9617
 
3.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)322211
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
451155
15.9%
249765
15.4%
538176
11.8%
034046
10.6%
828711
8.9%
728654
8.9%
126158
8.1%
921842
6.8%
619990
 
6.2%
314097
 
4.4%
Other values (4)9617
 
3.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)322211
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
451155
15.9%
249765
15.4%
538176
11.8%
034046
10.6%
828711
8.9%
728654
8.9%
126158
8.1%
921842
6.8%
619990
 
6.2%
314097
 
4.4%
Other values (4)9617
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)322211
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
451155
15.9%
249765
15.4%
538176
11.8%
034046
10.6%
828711
8.9%
728654
8.9%
126158
8.1%
921842
6.8%
619990
 
6.2%
314097
 
4.4%
Other values (4)9617
 
3.0%

diag_3
Text

Distinct790
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:53.171159image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.1116581
Min length1

Characters and Unicode

Total characters316661
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)0.1%

Sample

1st row?
2nd row255
3rd rowV27
4th row403
5th row250
ValueCountFrequency (%)
25011555
 
11.4%
4018289
 
8.1%
2765175
 
5.1%
4284577
 
4.5%
4273955
 
3.9%
4143664
 
3.6%
4962605
 
2.6%
4032357
 
2.3%
5851992
 
2.0%
2721969
 
1.9%
Other values (780)55628
54.7%
2025-11-30T12:34:54.549822image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251244
16.2%
449252
15.6%
541260
13.0%
039711
12.5%
726504
8.4%
124684
7.8%
823825
7.5%
917323
 
5.5%
616441
 
5.2%
314333
 
4.5%
Other values (4)12084
 
3.8%

Most occurring categories

ValueCountFrequency (%)
(unknown)316661
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
251244
16.2%
449252
15.6%
541260
13.0%
039711
12.5%
726504
8.4%
124684
7.8%
823825
7.5%
917323
 
5.5%
616441
 
5.2%
314333
 
4.5%
Other values (4)12084
 
3.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown)316661
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
251244
16.2%
449252
15.6%
541260
13.0%
039711
12.5%
726504
8.4%
124684
7.8%
823825
7.5%
917323
 
5.5%
616441
 
5.2%
314333
 
4.5%
Other values (4)12084
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown)316661
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
251244
16.2%
449252
15.6%
541260
13.0%
039711
12.5%
726504
8.4%
124684
7.8%
823825
7.5%
917323
 
5.5%
616441
 
5.2%
314333
 
4.5%
Other values (4)12084
 
3.8%

number_diagnoses
Real number (ℝ)

Distinct16
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.4226068
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size795.2 KiB
2025-11-30T12:34:54.727880image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q16
median8
Q39
95-th percentile9
Maximum16
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.9336001
Coefficient of variation (CV)0.26050149
Kurtosis-0.079056024
Mean7.4226068
Median Absolute Deviation (MAD)1
Skewness-0.87674624
Sum755369
Variance3.7388095
MonotonicityNot monotonic
2025-11-30T12:34:54.908792image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
949474
48.6%
511393
 
11.2%
810616
 
10.4%
710393
 
10.2%
610161
 
10.0%
45537
 
5.4%
32835
 
2.8%
21023
 
1.0%
1219
 
0.2%
1645
 
< 0.1%
Other values (6)70
 
0.1%
ValueCountFrequency (%)
1219
 
0.2%
21023
 
1.0%
32835
 
2.8%
45537
 
5.4%
511393
 
11.2%
610161
 
10.0%
710393
 
10.2%
810616
 
10.4%
949474
48.6%
1017
 
< 0.1%
ValueCountFrequency (%)
1645
 
< 0.1%
1510
 
< 0.1%
147
 
< 0.1%
1316
 
< 0.1%
129
 
< 0.1%
1111
 
< 0.1%
1017
 
< 0.1%
949474
48.6%
810616
 
10.4%
710393
 
10.2%

max_glu_serum
Categorical

High correlation  Missing 

Distinct3
Distinct (%)0.1%
Missing96420
Missing (%)94.7%
Memory size795.2 KiB
Norm
2597 
>200
1485 
>300
1264 

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters21384
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row>300
2nd row>300
3rd rowNorm
4th rowNorm
5th rowNorm

Common Values

ValueCountFrequency (%)
Norm2597
 
2.6%
>2001485
 
1.5%
>3001264
 
1.2%
(Missing)96420
94.7%

Length

2025-11-30T12:34:55.145071image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:55.311237image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
norm2597
48.6%
2001485
27.8%
3001264
23.6%

Most occurring characters

ValueCountFrequency (%)
05498
25.7%
>2749
12.9%
N2597
12.1%
o2597
12.1%
m2597
12.1%
r2597
12.1%
21485
 
6.9%
31264
 
5.9%

Most occurring categories

ValueCountFrequency (%)
(unknown)21384
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
05498
25.7%
>2749
12.9%
N2597
12.1%
o2597
12.1%
m2597
12.1%
r2597
12.1%
21485
 
6.9%
31264
 
5.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown)21384
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
05498
25.7%
>2749
12.9%
N2597
12.1%
o2597
12.1%
m2597
12.1%
r2597
12.1%
21485
 
6.9%
31264
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown)21384
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
05498
25.7%
>2749
12.9%
N2597
12.1%
o2597
12.1%
m2597
12.1%
r2597
12.1%
21485
 
6.9%
31264
 
5.9%

A1Cresult
Categorical

High correlation  Missing 

Distinct3
Distinct (%)< 0.1%
Missing84748
Missing (%)83.3%
Memory size795.2 KiB
>8
8216 
Norm
4990 
>7
3812 

Length

Max length4
Median length2
Mean length2.5864379
Min length2

Characters and Unicode

Total characters44016
Distinct characters7
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row>7
2nd row>7
3rd row>8
4th rowNorm
5th rowNorm

Common Values

ValueCountFrequency (%)
>88216
 
8.1%
Norm4990
 
4.9%
>73812
 
3.7%
(Missing)84748
83.3%

Length

2025-11-30T12:34:55.611029image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:55.809071image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
88216
48.3%
norm4990
29.3%
73812
22.4%

Most occurring characters

ValueCountFrequency (%)
>12028
27.3%
88216
18.7%
N4990
11.3%
o4990
11.3%
r4990
11.3%
m4990
11.3%
73812
 
8.7%

Most occurring categories

ValueCountFrequency (%)
(unknown)44016
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
>12028
27.3%
88216
18.7%
N4990
11.3%
o4990
11.3%
r4990
11.3%
m4990
11.3%
73812
 
8.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown)44016
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
>12028
27.3%
88216
18.7%
N4990
11.3%
o4990
11.3%
r4990
11.3%
m4990
11.3%
73812
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown)44016
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
>12028
27.3%
88216
18.7%
N4990
11.3%
o4990
11.3%
r4990
11.3%
m4990
11.3%
73812
 
8.7%

metformin
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
81778 
Steady
18346 
Up
 
1067
Down
 
575

Length

Max length6
Median length2
Mean length2.7324057
Min length2

Characters and Unicode

Total characters278066
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No81778
80.4%
Steady18346
 
18.0%
Up1067
 
1.0%
Down575
 
0.6%

Length

2025-11-30T12:34:56.012148image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:56.180889image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no81778
80.4%
steady18346
 
18.0%
up1067
 
1.0%
down575
 
0.6%

Most occurring characters

ValueCountFrequency (%)
o82353
29.6%
N81778
29.4%
S18346
 
6.6%
t18346
 
6.6%
e18346
 
6.6%
a18346
 
6.6%
d18346
 
6.6%
y18346
 
6.6%
U1067
 
0.4%
p1067
 
0.4%
Other values (3)1725
 
0.6%

Most occurring categories

ValueCountFrequency (%)
(unknown)278066
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o82353
29.6%
N81778
29.4%
S18346
 
6.6%
t18346
 
6.6%
e18346
 
6.6%
a18346
 
6.6%
d18346
 
6.6%
y18346
 
6.6%
U1067
 
0.4%
p1067
 
0.4%
Other values (3)1725
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown)278066
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o82353
29.6%
N81778
29.4%
S18346
 
6.6%
t18346
 
6.6%
e18346
 
6.6%
a18346
 
6.6%
d18346
 
6.6%
y18346
 
6.6%
U1067
 
0.4%
p1067
 
0.4%
Other values (3)1725
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown)278066
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o82353
29.6%
N81778
29.4%
S18346
 
6.6%
t18346
 
6.6%
e18346
 
6.6%
a18346
 
6.6%
d18346
 
6.6%
y18346
 
6.6%
U1067
 
0.4%
p1067
 
0.4%
Other values (3)1725
 
0.6%

repaglinide
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
100227 
Steady
 
1384
Up
 
110
Down
 
45

Length

Max length6
Median length2
Mean length2.0552837
Min length2

Characters and Unicode

Total characters209158
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No100227
98.5%
Steady1384
 
1.4%
Up110
 
0.1%
Down45
 
< 0.1%

Length

2025-11-30T12:34:56.413849image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:56.571842image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no100227
98.5%
steady1384
 
1.4%
up110
 
0.1%
down45
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o100272
47.9%
N100227
47.9%
S1384
 
0.7%
t1384
 
0.7%
e1384
 
0.7%
a1384
 
0.7%
d1384
 
0.7%
y1384
 
0.7%
U110
 
0.1%
p110
 
0.1%
Other values (3)135
 
0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)209158
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o100272
47.9%
N100227
47.9%
S1384
 
0.7%
t1384
 
0.7%
e1384
 
0.7%
a1384
 
0.7%
d1384
 
0.7%
y1384
 
0.7%
U110
 
0.1%
p110
 
0.1%
Other values (3)135
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)209158
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o100272
47.9%
N100227
47.9%
S1384
 
0.7%
t1384
 
0.7%
e1384
 
0.7%
a1384
 
0.7%
d1384
 
0.7%
y1384
 
0.7%
U110
 
0.1%
p110
 
0.1%
Other values (3)135
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)209158
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o100272
47.9%
N100227
47.9%
S1384
 
0.7%
t1384
 
0.7%
e1384
 
0.7%
a1384
 
0.7%
d1384
 
0.7%
y1384
 
0.7%
U110
 
0.1%
p110
 
0.1%
Other values (3)135
 
0.1%

nateglinide
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101063 
Steady
 
668
Up
 
24
Down
 
11

Length

Max length6
Median length2
Mean length2.0264725
Min length2

Characters and Unicode

Total characters206226
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101063
99.3%
Steady668
 
0.7%
Up24
 
< 0.1%
Down11
 
< 0.1%

Length

2025-11-30T12:34:56.779855image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:56.968012image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101063
99.3%
steady668
 
0.7%
up24
 
< 0.1%
down11
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o101074
49.0%
N101063
49.0%
S668
 
0.3%
t668
 
0.3%
e668
 
0.3%
a668
 
0.3%
d668
 
0.3%
y668
 
0.3%
U24
 
< 0.1%
p24
 
< 0.1%
Other values (3)33
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)206226
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o101074
49.0%
N101063
49.0%
S668
 
0.3%
t668
 
0.3%
e668
 
0.3%
a668
 
0.3%
d668
 
0.3%
y668
 
0.3%
U24
 
< 0.1%
p24
 
< 0.1%
Other values (3)33
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)206226
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o101074
49.0%
N101063
49.0%
S668
 
0.3%
t668
 
0.3%
e668
 
0.3%
a668
 
0.3%
d668
 
0.3%
y668
 
0.3%
U24
 
< 0.1%
p24
 
< 0.1%
Other values (3)33
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)206226
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o101074
49.0%
N101063
49.0%
S668
 
0.3%
t668
 
0.3%
e668
 
0.3%
a668
 
0.3%
d668
 
0.3%
y668
 
0.3%
U24
 
< 0.1%
p24
 
< 0.1%
Other values (3)33
 
< 0.1%

chlorpropamide
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101680 
Steady
 
79
Up
 
6
Down
 
1

Length

Max length6
Median length2
Mean length2.0031248
Min length2

Characters and Unicode

Total characters203850
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101680
99.9%
Steady79
 
0.1%
Up6
 
< 0.1%
Down1
 
< 0.1%

Length

2025-11-30T12:34:57.172472image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:57.367644image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101680
99.9%
steady79
 
0.1%
up6
 
< 0.1%
down1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o101681
49.9%
N101680
49.9%
S79
 
< 0.1%
t79
 
< 0.1%
e79
 
< 0.1%
a79
 
< 0.1%
d79
 
< 0.1%
y79
 
< 0.1%
U6
 
< 0.1%
p6
 
< 0.1%
Other values (3)3
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203850
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o101681
49.9%
N101680
49.9%
S79
 
< 0.1%
t79
 
< 0.1%
e79
 
< 0.1%
a79
 
< 0.1%
d79
 
< 0.1%
y79
 
< 0.1%
U6
 
< 0.1%
p6
 
< 0.1%
Other values (3)3
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203850
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o101681
49.9%
N101680
49.9%
S79
 
< 0.1%
t79
 
< 0.1%
e79
 
< 0.1%
a79
 
< 0.1%
d79
 
< 0.1%
y79
 
< 0.1%
U6
 
< 0.1%
p6
 
< 0.1%
Other values (3)3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203850
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o101681
49.9%
N101680
49.9%
S79
 
< 0.1%
t79
 
< 0.1%
e79
 
< 0.1%
a79
 
< 0.1%
d79
 
< 0.1%
y79
 
< 0.1%
U6
 
< 0.1%
p6
 
< 0.1%
Other values (3)3
 
< 0.1%

glimepiride
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
96575 
Steady
 
4670
Up
 
327
Down
 
194

Length

Max length6
Median length2
Mean length2.187371
Min length2

Characters and Unicode

Total characters222600
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No96575
94.9%
Steady4670
 
4.6%
Up327
 
0.3%
Down194
 
0.2%

Length

2025-11-30T12:34:57.580514image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:57.758135image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no96575
94.9%
steady4670
 
4.6%
up327
 
0.3%
down194
 
0.2%

Most occurring characters

ValueCountFrequency (%)
o96769
43.5%
N96575
43.4%
S4670
 
2.1%
t4670
 
2.1%
e4670
 
2.1%
a4670
 
2.1%
d4670
 
2.1%
y4670
 
2.1%
U327
 
0.1%
p327
 
0.1%
Other values (3)582
 
0.3%

Most occurring categories

ValueCountFrequency (%)
(unknown)222600
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o96769
43.5%
N96575
43.4%
S4670
 
2.1%
t4670
 
2.1%
e4670
 
2.1%
a4670
 
2.1%
d4670
 
2.1%
y4670
 
2.1%
U327
 
0.1%
p327
 
0.1%
Other values (3)582
 
0.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown)222600
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o96769
43.5%
N96575
43.4%
S4670
 
2.1%
t4670
 
2.1%
e4670
 
2.1%
a4670
 
2.1%
d4670
 
2.1%
y4670
 
2.1%
U327
 
0.1%
p327
 
0.1%
Other values (3)582
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown)222600
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o96769
43.5%
N96575
43.4%
S4670
 
2.1%
t4670
 
2.1%
e4670
 
2.1%
a4670
 
2.1%
d4670
 
2.1%
y4670
 
2.1%
U327
 
0.1%
p327
 
0.1%
Other values (3)582
 
0.3%

acetohexamide
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101765 
Steady
 
1

Length

Max length6
Median length2
Mean length2.0000393
Min length2

Characters and Unicode

Total characters203536
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101765
> 99.9%
Steady1
 
< 0.1%

Length

2025-11-30T12:34:58.597361image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:58.765841image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101765
> 99.9%
steady1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

glipizide
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
89080 
Steady
11356 
Up
 
770
Down
 
560

Length

Max length6
Median length2
Mean length2.457363
Min length2

Characters and Unicode

Total characters250076
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowSteady
4th rowNo
5th rowSteady

Common Values

ValueCountFrequency (%)
No89080
87.5%
Steady11356
 
11.2%
Up770
 
0.8%
Down560
 
0.6%

Length

2025-11-30T12:34:58.951721image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:59.109704image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no89080
87.5%
steady11356
 
11.2%
up770
 
0.8%
down560
 
0.6%

Most occurring characters

ValueCountFrequency (%)
o89640
35.8%
N89080
35.6%
S11356
 
4.5%
t11356
 
4.5%
e11356
 
4.5%
a11356
 
4.5%
d11356
 
4.5%
y11356
 
4.5%
U770
 
0.3%
p770
 
0.3%
Other values (3)1680
 
0.7%

Most occurring categories

ValueCountFrequency (%)
(unknown)250076
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o89640
35.8%
N89080
35.6%
S11356
 
4.5%
t11356
 
4.5%
e11356
 
4.5%
a11356
 
4.5%
d11356
 
4.5%
y11356
 
4.5%
U770
 
0.3%
p770
 
0.3%
Other values (3)1680
 
0.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown)250076
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o89640
35.8%
N89080
35.6%
S11356
 
4.5%
t11356
 
4.5%
e11356
 
4.5%
a11356
 
4.5%
d11356
 
4.5%
y11356
 
4.5%
U770
 
0.3%
p770
 
0.3%
Other values (3)1680
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown)250076
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o89640
35.8%
N89080
35.6%
S11356
 
4.5%
t11356
 
4.5%
e11356
 
4.5%
a11356
 
4.5%
d11356
 
4.5%
y11356
 
4.5%
U770
 
0.3%
p770
 
0.3%
Other values (3)1680
 
0.7%

glyburide
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
91116 
Steady
9274 
Up
 
812
Down
 
564

Length

Max length6
Median length2
Mean length2.3756068
Min length2

Characters and Unicode

Total characters241756
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No91116
89.5%
Steady9274
 
9.1%
Up812
 
0.8%
Down564
 
0.6%

Length

2025-11-30T12:34:59.323585image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:59.549144image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no91116
89.5%
steady9274
 
9.1%
up812
 
0.8%
down564
 
0.6%

Most occurring characters

ValueCountFrequency (%)
o91680
37.9%
N91116
37.7%
S9274
 
3.8%
t9274
 
3.8%
e9274
 
3.8%
a9274
 
3.8%
d9274
 
3.8%
y9274
 
3.8%
U812
 
0.3%
p812
 
0.3%
Other values (3)1692
 
0.7%

Most occurring categories

ValueCountFrequency (%)
(unknown)241756
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o91680
37.9%
N91116
37.7%
S9274
 
3.8%
t9274
 
3.8%
e9274
 
3.8%
a9274
 
3.8%
d9274
 
3.8%
y9274
 
3.8%
U812
 
0.3%
p812
 
0.3%
Other values (3)1692
 
0.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown)241756
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o91680
37.9%
N91116
37.7%
S9274
 
3.8%
t9274
 
3.8%
e9274
 
3.8%
a9274
 
3.8%
d9274
 
3.8%
y9274
 
3.8%
U812
 
0.3%
p812
 
0.3%
Other values (3)1692
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown)241756
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o91680
37.9%
N91116
37.7%
S9274
 
3.8%
t9274
 
3.8%
e9274
 
3.8%
a9274
 
3.8%
d9274
 
3.8%
y9274
 
3.8%
U812
 
0.3%
p812
 
0.3%
Other values (3)1692
 
0.7%

tolbutamide
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101743 
Steady
 
23

Length

Max length6
Median length2
Mean length2.000904
Min length2

Characters and Unicode

Total characters203624
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101743
> 99.9%
Steady23
 
< 0.1%

Length

2025-11-30T12:34:59.787787image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:34:59.937781image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101743
> 99.9%
steady23
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N101743
50.0%
o101743
50.0%
S23
 
< 0.1%
t23
 
< 0.1%
e23
 
< 0.1%
a23
 
< 0.1%
d23
 
< 0.1%
y23
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203624
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N101743
50.0%
o101743
50.0%
S23
 
< 0.1%
t23
 
< 0.1%
e23
 
< 0.1%
a23
 
< 0.1%
d23
 
< 0.1%
y23
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203624
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N101743
50.0%
o101743
50.0%
S23
 
< 0.1%
t23
 
< 0.1%
e23
 
< 0.1%
a23
 
< 0.1%
d23
 
< 0.1%
y23
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203624
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N101743
50.0%
o101743
50.0%
S23
 
< 0.1%
t23
 
< 0.1%
e23
 
< 0.1%
a23
 
< 0.1%
d23
 
< 0.1%
y23
 
< 0.1%

pioglitazone
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
94438 
Steady
 
6976
Up
 
234
Down
 
118

Length

Max length6
Median length2
Mean length2.2765167
Min length2

Characters and Unicode

Total characters231672
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No94438
92.8%
Steady6976
 
6.9%
Up234
 
0.2%
Down118
 
0.1%

Length

2025-11-30T12:35:00.155000image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:00.333131image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no94438
92.8%
steady6976
 
6.9%
up234
 
0.2%
down118
 
0.1%

Most occurring characters

ValueCountFrequency (%)
o94556
40.8%
N94438
40.8%
S6976
 
3.0%
t6976
 
3.0%
e6976
 
3.0%
a6976
 
3.0%
d6976
 
3.0%
y6976
 
3.0%
U234
 
0.1%
p234
 
0.1%
Other values (3)354
 
0.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)231672
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o94556
40.8%
N94438
40.8%
S6976
 
3.0%
t6976
 
3.0%
e6976
 
3.0%
a6976
 
3.0%
d6976
 
3.0%
y6976
 
3.0%
U234
 
0.1%
p234
 
0.1%
Other values (3)354
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)231672
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o94556
40.8%
N94438
40.8%
S6976
 
3.0%
t6976
 
3.0%
e6976
 
3.0%
a6976
 
3.0%
d6976
 
3.0%
y6976
 
3.0%
U234
 
0.1%
p234
 
0.1%
Other values (3)354
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)231672
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o94556
40.8%
N94438
40.8%
S6976
 
3.0%
t6976
 
3.0%
e6976
 
3.0%
a6976
 
3.0%
d6976
 
3.0%
y6976
 
3.0%
U234
 
0.1%
p234
 
0.1%
Other values (3)354
 
0.2%

rosiglitazone
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
95401 
Steady
 
6100
Up
 
178
Down
 
87

Length

Max length6
Median length2
Mean length2.2414755
Min length2

Characters and Unicode

Total characters228106
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No95401
93.7%
Steady6100
 
6.0%
Up178
 
0.2%
Down87
 
0.1%

Length

2025-11-30T12:35:00.563325image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:00.738033image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no95401
93.7%
steady6100
 
6.0%
up178
 
0.2%
down87
 
0.1%

Most occurring characters

ValueCountFrequency (%)
o95488
41.9%
N95401
41.8%
S6100
 
2.7%
t6100
 
2.7%
e6100
 
2.7%
a6100
 
2.7%
d6100
 
2.7%
y6100
 
2.7%
U178
 
0.1%
p178
 
0.1%
Other values (3)261
 
0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)228106
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o95488
41.9%
N95401
41.8%
S6100
 
2.7%
t6100
 
2.7%
e6100
 
2.7%
a6100
 
2.7%
d6100
 
2.7%
y6100
 
2.7%
U178
 
0.1%
p178
 
0.1%
Other values (3)261
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)228106
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o95488
41.9%
N95401
41.8%
S6100
 
2.7%
t6100
 
2.7%
e6100
 
2.7%
a6100
 
2.7%
d6100
 
2.7%
y6100
 
2.7%
U178
 
0.1%
p178
 
0.1%
Other values (3)261
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)228106
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o95488
41.9%
N95401
41.8%
S6100
 
2.7%
t6100
 
2.7%
e6100
 
2.7%
a6100
 
2.7%
d6100
 
2.7%
y6100
 
2.7%
U178
 
0.1%
p178
 
0.1%
Other values (3)261
 
0.1%

acarbose
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101458 
Steady
 
295
Up
 
10
Down
 
3

Length

Max length6
Median length2
Mean length2.0116542
Min length2

Characters and Unicode

Total characters204718
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101458
99.7%
Steady295
 
0.3%
Up10
 
< 0.1%
Down3
 
< 0.1%

Length

2025-11-30T12:35:00.964375image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:01.163501image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101458
99.7%
steady295
 
0.3%
up10
 
< 0.1%
down3
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o101461
49.6%
N101458
49.6%
S295
 
0.1%
t295
 
0.1%
e295
 
0.1%
a295
 
0.1%
d295
 
0.1%
y295
 
0.1%
U10
 
< 0.1%
p10
 
< 0.1%
Other values (3)9
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)204718
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o101461
49.6%
N101458
49.6%
S295
 
0.1%
t295
 
0.1%
e295
 
0.1%
a295
 
0.1%
d295
 
0.1%
y295
 
0.1%
U10
 
< 0.1%
p10
 
< 0.1%
Other values (3)9
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)204718
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o101461
49.6%
N101458
49.6%
S295
 
0.1%
t295
 
0.1%
e295
 
0.1%
a295
 
0.1%
d295
 
0.1%
y295
 
0.1%
U10
 
< 0.1%
p10
 
< 0.1%
Other values (3)9
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)204718
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o101461
49.6%
N101458
49.6%
S295
 
0.1%
t295
 
0.1%
e295
 
0.1%
a295
 
0.1%
d295
 
0.1%
y295
 
0.1%
U10
 
< 0.1%
p10
 
< 0.1%
Other values (3)9
 
< 0.1%

miglitol
Categorical

High correlation  Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101728 
Steady
 
31
Down
 
5
Up
 
2

Length

Max length6
Median length2
Mean length2.0013167
Min length2

Characters and Unicode

Total characters203666
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101728
> 99.9%
Steady31
 
< 0.1%
Down5
 
< 0.1%
Up2
 
< 0.1%

Length

2025-11-30T12:35:01.435952image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:01.593889image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101728
> 99.9%
steady31
 
< 0.1%
down5
 
< 0.1%
up2
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o101733
50.0%
N101728
49.9%
S31
 
< 0.1%
t31
 
< 0.1%
e31
 
< 0.1%
a31
 
< 0.1%
d31
 
< 0.1%
y31
 
< 0.1%
D5
 
< 0.1%
w5
 
< 0.1%
Other values (3)9
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203666
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o101733
50.0%
N101728
49.9%
S31
 
< 0.1%
t31
 
< 0.1%
e31
 
< 0.1%
a31
 
< 0.1%
d31
 
< 0.1%
y31
 
< 0.1%
D5
 
< 0.1%
w5
 
< 0.1%
Other values (3)9
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203666
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o101733
50.0%
N101728
49.9%
S31
 
< 0.1%
t31
 
< 0.1%
e31
 
< 0.1%
a31
 
< 0.1%
d31
 
< 0.1%
y31
 
< 0.1%
D5
 
< 0.1%
w5
 
< 0.1%
Other values (3)9
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203666
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o101733
50.0%
N101728
49.9%
S31
 
< 0.1%
t31
 
< 0.1%
e31
 
< 0.1%
a31
 
< 0.1%
d31
 
< 0.1%
y31
 
< 0.1%
D5
 
< 0.1%
w5
 
< 0.1%
Other values (3)9
 
< 0.1%

troglitazone
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101763 
Steady
 
3

Length

Max length6
Median length2
Mean length2.0001179
Min length2

Characters and Unicode

Total characters203544
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101763
> 99.9%
Steady3
 
< 0.1%

Length

2025-11-30T12:35:01.786015image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:01.934213image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101763
> 99.9%
steady3
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N101763
50.0%
o101763
50.0%
S3
 
< 0.1%
t3
 
< 0.1%
e3
 
< 0.1%
a3
 
< 0.1%
d3
 
< 0.1%
y3
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203544
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N101763
50.0%
o101763
50.0%
S3
 
< 0.1%
t3
 
< 0.1%
e3
 
< 0.1%
a3
 
< 0.1%
d3
 
< 0.1%
y3
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203544
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N101763
50.0%
o101763
50.0%
S3
 
< 0.1%
t3
 
< 0.1%
e3
 
< 0.1%
a3
 
< 0.1%
d3
 
< 0.1%
y3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203544
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N101763
50.0%
o101763
50.0%
S3
 
< 0.1%
t3
 
< 0.1%
e3
 
< 0.1%
a3
 
< 0.1%
d3
 
< 0.1%
y3
 
< 0.1%

tolazamide
Categorical

Imbalance 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101727 
Steady
 
38
Up
 
1

Length

Max length6
Median length2
Mean length2.0014936
Min length2

Characters and Unicode

Total characters203684
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101727
> 99.9%
Steady38
 
< 0.1%
Up1
 
< 0.1%

Length

2025-11-30T12:35:02.116033image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:02.297644image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101727
> 99.9%
steady38
 
< 0.1%
up1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N101727
49.9%
o101727
49.9%
S38
 
< 0.1%
t38
 
< 0.1%
e38
 
< 0.1%
a38
 
< 0.1%
d38
 
< 0.1%
y38
 
< 0.1%
U1
 
< 0.1%
p1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203684
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N101727
49.9%
o101727
49.9%
S38
 
< 0.1%
t38
 
< 0.1%
e38
 
< 0.1%
a38
 
< 0.1%
d38
 
< 0.1%
y38
 
< 0.1%
U1
 
< 0.1%
p1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203684
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N101727
49.9%
o101727
49.9%
S38
 
< 0.1%
t38
 
< 0.1%
e38
 
< 0.1%
a38
 
< 0.1%
d38
 
< 0.1%
y38
 
< 0.1%
U1
 
< 0.1%
p1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203684
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N101727
49.9%
o101727
49.9%
S38
 
< 0.1%
t38
 
< 0.1%
e38
 
< 0.1%
a38
 
< 0.1%
d38
 
< 0.1%
y38
 
< 0.1%
U1
 
< 0.1%
p1
 
< 0.1%

examide
Boolean

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size99.5 KiB
False
101766 
ValueCountFrequency (%)
False101766
100.0%
2025-11-30T12:35:02.386109image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

citoglipton
Boolean

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size99.5 KiB
False
101766 
ValueCountFrequency (%)
False101766
100.0%
2025-11-30T12:35:02.472331image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

insulin
Categorical

High correlation 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
47383 
Steady
30849 
Down
12218 
Up
11316 

Length

Max length6
Median length2
Mean length3.4526659
Min length2

Characters and Unicode

Total characters351364
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowUp
3rd rowNo
4th rowUp
5th rowSteady

Common Values

ValueCountFrequency (%)
No47383
46.6%
Steady30849
30.3%
Down12218
 
12.0%
Up11316
 
11.1%

Length

2025-11-30T12:35:02.640085image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:02.806710image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no47383
46.6%
steady30849
30.3%
down12218
 
12.0%
up11316
 
11.1%

Most occurring characters

ValueCountFrequency (%)
o59601
17.0%
N47383
13.5%
S30849
8.8%
t30849
8.8%
e30849
8.8%
a30849
8.8%
d30849
8.8%
y30849
8.8%
D12218
 
3.5%
w12218
 
3.5%
Other values (3)34850
9.9%

Most occurring categories

ValueCountFrequency (%)
(unknown)351364
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o59601
17.0%
N47383
13.5%
S30849
8.8%
t30849
8.8%
e30849
8.8%
a30849
8.8%
d30849
8.8%
y30849
8.8%
D12218
 
3.5%
w12218
 
3.5%
Other values (3)34850
9.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown)351364
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o59601
17.0%
N47383
13.5%
S30849
8.8%
t30849
8.8%
e30849
8.8%
a30849
8.8%
d30849
8.8%
y30849
8.8%
D12218
 
3.5%
w12218
 
3.5%
Other values (3)34850
9.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown)351364
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o59601
17.0%
N47383
13.5%
S30849
8.8%
t30849
8.8%
e30849
8.8%
a30849
8.8%
d30849
8.8%
y30849
8.8%
D12218
 
3.5%
w12218
 
3.5%
Other values (3)34850
9.9%

glyburide-metformin
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101060 
Steady
 
692
Up
 
8
Down
 
6

Length

Max length6
Median length2
Mean length2.0273176
Min length2

Characters and Unicode

Total characters206312
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101060
99.3%
Steady692
 
0.7%
Up8
 
< 0.1%
Down6
 
< 0.1%

Length

2025-11-30T12:35:03.025032image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:03.173707image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101060
99.3%
steady692
 
0.7%
up8
 
< 0.1%
down6
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o101066
49.0%
N101060
49.0%
S692
 
0.3%
t692
 
0.3%
e692
 
0.3%
a692
 
0.3%
d692
 
0.3%
y692
 
0.3%
U8
 
< 0.1%
p8
 
< 0.1%
Other values (3)18
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)206312
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o101066
49.0%
N101060
49.0%
S692
 
0.3%
t692
 
0.3%
e692
 
0.3%
a692
 
0.3%
d692
 
0.3%
y692
 
0.3%
U8
 
< 0.1%
p8
 
< 0.1%
Other values (3)18
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)206312
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o101066
49.0%
N101060
49.0%
S692
 
0.3%
t692
 
0.3%
e692
 
0.3%
a692
 
0.3%
d692
 
0.3%
y692
 
0.3%
U8
 
< 0.1%
p8
 
< 0.1%
Other values (3)18
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)206312
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o101066
49.0%
N101060
49.0%
S692
 
0.3%
t692
 
0.3%
e692
 
0.3%
a692
 
0.3%
d692
 
0.3%
y692
 
0.3%
U8
 
< 0.1%
p8
 
< 0.1%
Other values (3)18
 
< 0.1%

glipizide-metformin
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101753 
Steady
 
13

Length

Max length6
Median length2
Mean length2.000511
Min length2

Characters and Unicode

Total characters203584
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101753
> 99.9%
Steady13
 
< 0.1%

Length

2025-11-30T12:35:03.368550image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:03.517372image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101753
> 99.9%
steady13
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N101753
50.0%
o101753
50.0%
S13
 
< 0.1%
t13
 
< 0.1%
e13
 
< 0.1%
a13
 
< 0.1%
d13
 
< 0.1%
y13
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203584
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N101753
50.0%
o101753
50.0%
S13
 
< 0.1%
t13
 
< 0.1%
e13
 
< 0.1%
a13
 
< 0.1%
d13
 
< 0.1%
y13
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203584
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N101753
50.0%
o101753
50.0%
S13
 
< 0.1%
t13
 
< 0.1%
e13
 
< 0.1%
a13
 
< 0.1%
d13
 
< 0.1%
y13
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203584
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N101753
50.0%
o101753
50.0%
S13
 
< 0.1%
t13
 
< 0.1%
e13
 
< 0.1%
a13
 
< 0.1%
d13
 
< 0.1%
y13
 
< 0.1%

glimepiride-pioglitazone
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101765 
Steady
 
1

Length

Max length6
Median length2
Mean length2.0000393
Min length2

Characters and Unicode

Total characters203536
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101765
> 99.9%
Steady1
 
< 0.1%

Length

2025-11-30T12:35:03.687933image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:03.821376image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101765
> 99.9%
steady1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

metformin-rosiglitazone
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101764 
Steady
 
2

Length

Max length6
Median length2
Mean length2.0000786
Min length2

Characters and Unicode

Total characters203540
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101764
> 99.9%
Steady2
 
< 0.1%

Length

2025-11-30T12:35:03.976439image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:04.110632image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101764
> 99.9%
steady2
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N101764
50.0%
o101764
50.0%
S2
 
< 0.1%
t2
 
< 0.1%
e2
 
< 0.1%
a2
 
< 0.1%
d2
 
< 0.1%
y2
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203540
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N101764
50.0%
o101764
50.0%
S2
 
< 0.1%
t2
 
< 0.1%
e2
 
< 0.1%
a2
 
< 0.1%
d2
 
< 0.1%
y2
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203540
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N101764
50.0%
o101764
50.0%
S2
 
< 0.1%
t2
 
< 0.1%
e2
 
< 0.1%
a2
 
< 0.1%
d2
 
< 0.1%
y2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203540
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N101764
50.0%
o101764
50.0%
S2
 
< 0.1%
t2
 
< 0.1%
e2
 
< 0.1%
a2
 
< 0.1%
d2
 
< 0.1%
y2
 
< 0.1%

metformin-pioglitazone
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101765 
Steady
 
1

Length

Max length6
Median length2
Mean length2.0000393
Min length2

Characters and Unicode

Total characters203536
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No101765
> 99.9%
Steady1
 
< 0.1%

Length

2025-11-30T12:35:04.265628image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:04.397900image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no101765
> 99.9%
steady1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203536
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N101765
50.0%
o101765
50.0%
S1
 
< 0.1%
t1
 
< 0.1%
e1
 
< 0.1%
a1
 
< 0.1%
d1
 
< 0.1%
y1
 
< 0.1%

change
Categorical

High correlation 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
54755 
Ch
47011 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters203532
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowCh
3rd rowNo
4th rowCh
5th rowCh

Common Values

ValueCountFrequency (%)
No54755
53.8%
Ch47011
46.2%

Length

2025-11-30T12:35:04.570344image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:04.724627image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no54755
53.8%
ch47011
46.2%

Most occurring characters

ValueCountFrequency (%)
N54755
26.9%
o54755
26.9%
C47011
23.1%
h47011
23.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)203532
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N54755
26.9%
o54755
26.9%
C47011
23.1%
h47011
23.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)203532
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N54755
26.9%
o54755
26.9%
C47011
23.1%
h47011
23.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)203532
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N54755
26.9%
o54755
26.9%
C47011
23.1%
h47011
23.1%

diabetesMed
Boolean

High correlation 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size99.5 KiB
True
78363 
False
23403 
ValueCountFrequency (%)
True78363
77.0%
False23403
 
23.0%
2025-11-30T12:35:04.811835image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

readmitted
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
NO
54864 
>30
35545 
<30
11357 

Length

Max length3
Median length2
Mean length2.4608808
Min length2

Characters and Unicode

Total characters250434
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNO
2nd row>30
3rd rowNO
4th rowNO
5th rowNO

Common Values

ValueCountFrequency (%)
NO54864
53.9%
>3035545
34.9%
<3011357
 
11.2%

Length

2025-11-30T12:35:04.964493image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-11-30T12:35:05.101782image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
no54864
53.9%
3046902
46.1%

Most occurring characters

ValueCountFrequency (%)
N54864
21.9%
O54864
21.9%
346902
18.7%
046902
18.7%
>35545
14.2%
<11357
 
4.5%

Most occurring categories

ValueCountFrequency (%)
(unknown)250434
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N54864
21.9%
O54864
21.9%
346902
18.7%
046902
18.7%
>35545
14.2%
<11357
 
4.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown)250434
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N54864
21.9%
O54864
21.9%
346902
18.7%
046902
18.7%
>35545
14.2%
<11357
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown)250434
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N54864
21.9%
O54864
21.9%
346902
18.7%
046902
18.7%
>35545
14.2%
<11357
 
4.5%

Interactions

2025-11-30T12:34:29.695206image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:54.372012image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:57.001988image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:59.870100image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:02.598438image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:05.437713image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:08.086796image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:10.895316image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:13.815244image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:16.497352image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:19.789612image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:22.727168image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:25.799060image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:29.942142image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:54.587462image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:57.209116image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:00.089406image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:02.790189image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:05.637178image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:08.306332image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:11.100045image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:14.004611image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:16.703897image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:19.974589image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:22.935766image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:25.995538image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:30.236744image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:54.804589image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:57.466405image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:00.299371image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:03.025833image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:05.845997image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:08.567300image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:11.317709image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:14.237386image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:16.955856image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:20.201447image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:23.153806image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:26.258281image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:30.520068image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:54.996264image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:57.683424image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:00.499091image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:03.224191image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:06.055672image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:08.805415image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:11.618762image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:14.437549image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:17.211690image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:20.427001image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:23.372969image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:26.590281image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:30.775238image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:55.186140image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:57.903933image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:00.719710image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:03.419345image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:06.251080image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:09.018247image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:11.829130image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:14.636086image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:17.434430image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:20.659338image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:23.605389image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:26.897323image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:31.009345image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:55.375961image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:58.114404image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:00.936585image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:03.616011image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:06.441696image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:09.215516image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:12.075744image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:14.833459image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:17.652963image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:20.870991image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:23.856221image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:27.145741image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:31.319140image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:55.565523image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:58.328775image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:01.144134image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:03.814807image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:06.649269image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:09.410892image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:12.298822image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:15.042480image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:18.218373image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:21.099643image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:24.101366image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:27.361307image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:31.573514image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:55.777389image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:58.550360image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:01.350799image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:04.029741image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:06.852994image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:09.621299image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:12.524740image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:15.259162image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:18.445862image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:21.304626image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:24.347586image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:27.628185image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:31.810138image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:55.962334image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:58.765440image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:01.552975image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:04.436682image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:07.054600image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:09.835749image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:12.753325image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:15.456699image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:18.657635image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:21.558012image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:24.638556image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:27.836106image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:32.049528image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:56.157497image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:58.993648image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:01.781911image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:04.646116image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:07.267792image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:10.057117image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:12.988575image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:15.667892image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:18.874884image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:21.910625image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:24.890970image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:28.340414image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:32.286318image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:56.349720image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:59.197771image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:01.984813image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:04.836381image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:07.456131image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:10.257050image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:13.200938image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:15.864105image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:19.078735image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:22.109873image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:25.100038image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:28.745791image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:32.519553image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:56.544624image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:59.422476image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:02.199279image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:05.050901image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:07.664542image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:10.480824image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:13.408861image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:16.079387image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:19.364880image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:22.331411image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:25.328902image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:29.185246image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:32.729534image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:56.776442image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:33:59.639596image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:02.391273image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:05.238589image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:07.860726image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:10.681781image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:13.602219image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:16.282819image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:19.570924image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:22.526494image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:25.550094image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-11-30T12:34:29.421111image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2025-11-30T12:35:05.587949image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A1Cresultacarboseacetohexamideadmission_source_idadmission_type_idagechangechlorpropamidediabetesMeddischarge_disposition_idencounter_idgenderglimepirideglimepiride-pioglitazoneglipizideglipizide-metforminglyburideglyburide-metformininsulinmax_glu_serummetforminmetformin-pioglitazonemetformin-rosiglitazonemiglitolnateglinidenum_lab_proceduresnum_medicationsnum_proceduresnumber_diagnosesnumber_emergencynumber_inpatientnumber_outpatientpatient_nbrpayer_codepioglitazoneracereadmittedrepagliniderosiglitazonetime_in_hospitaltolazamidetolbutamidetroglitazoneweight
A1Cresult1.0000.0091.0000.0420.0690.1830.1870.0030.1820.0400.1320.0330.0321.0000.0400.0170.0320.0000.1530.3700.0541.0000.0000.0000.0000.0300.0290.0240.1120.0070.0220.0190.1180.1580.0140.0640.0190.0270.0150.0220.0030.0031.0000.016
acarbose0.0091.0000.0000.0000.0000.0020.0460.0000.0300.0000.0060.0070.0100.0000.0220.0000.0070.0040.0110.0160.0130.0000.0000.0010.0000.0000.0130.0040.0000.0000.0070.0000.0110.0100.0070.0070.0120.0120.0020.0070.0000.0000.0000.000
acetohexamide1.0000.0001.0000.0000.0000.0000.0000.0000.0000.0200.0000.0000.0000.0000.0000.0000.0000.0000.0001.0000.0000.0000.0000.0000.0000.0040.0190.0130.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0190.0000.0000.0000.000
admission_source_id0.0420.0000.0001.000-0.3830.0350.0230.0000.0180.042-0.0510.0120.0160.0000.0070.0000.0190.0170.0410.1480.0290.0000.0000.0000.0070.136-0.063-0.2050.1060.1040.0560.0240.0300.0810.0170.0740.0560.0180.0210.0030.0000.0040.0000.031
admission_type_id0.0690.0000.000-0.3831.0000.0380.0630.0020.0430.021-0.1230.0130.0360.0000.0120.0000.0070.0270.0640.1230.0320.0000.0000.0050.012-0.2240.0870.217-0.127-0.033-0.0450.0300.0070.1350.0200.0630.0440.0340.019-0.0150.0060.0130.0000.043
age0.1830.0020.0000.0350.0381.0000.0560.0030.0440.0600.0370.0780.0240.0000.0370.0000.0500.0100.0680.1330.0660.0000.0000.0040.0080.0230.0600.0650.1310.0270.0490.0040.0390.1530.0300.0850.0380.0290.0260.0430.0000.0140.0000.026
change0.1870.0460.0000.0230.0630.0561.0000.0120.5060.0810.1200.0140.1440.0000.2090.0070.1910.0430.6410.2480.3290.0000.0000.0140.0550.0700.2440.0270.0570.0150.0170.0150.1300.1480.2030.0210.0460.0780.1960.1150.0000.0000.0030.048
chlorpropamide0.0030.0000.0000.0000.0020.0030.0121.0000.0150.0190.0140.0000.0000.0000.0020.0000.0000.0000.0100.0000.0030.0000.0000.0000.0000.0000.0000.0030.0060.0000.0000.0000.0060.0030.0000.0030.0040.0000.0000.0030.0000.0000.0000.000
diabetesMed0.1820.0300.0000.0180.0430.0440.5060.0151.0000.0830.0680.0150.1270.0000.2060.0040.1870.0450.5850.1910.2700.0000.0000.0090.0450.0430.1960.0300.0320.0070.0180.0010.0680.0950.1520.0220.0610.0680.1410.0700.0100.0070.0000.036
discharge_disposition_id0.0400.0000.0200.0420.0210.0600.0810.0190.0831.000-0.0650.0270.0220.0000.0280.0100.0510.0150.0780.0710.0360.0000.0000.0040.0060.0590.1710.0130.1510.0070.0850.033-0.0460.0940.0240.0280.1200.0160.0170.2760.0170.0100.0110.016
encounter_id0.1320.0060.000-0.051-0.1230.0370.1200.0140.068-0.0651.0000.0110.0300.0010.0230.0040.0540.0290.1020.1770.0280.0140.0110.0050.022-0.0090.102-0.0310.2930.1310.0370.1510.5440.2440.0360.0780.0730.0190.044-0.0600.0140.0100.0130.020
gender0.0330.0070.0000.0120.0130.0780.0140.0000.0150.0270.0111.0000.0000.0000.0190.0050.0230.0000.0000.0000.0000.0000.0020.0040.0000.0170.0360.0450.0000.0000.0080.0000.0220.0610.0040.0540.0130.0000.0110.0280.0030.0000.0040.027
glimepiride0.0320.0100.0000.0160.0360.0240.1440.0000.1270.0220.0300.0001.0000.0000.0420.0000.0400.0040.0100.0000.0280.0000.0000.0130.0090.0190.0290.0070.0100.0090.0000.0000.0230.0330.0260.0140.0070.0030.0250.0250.0000.0000.0050.007
glimepiride-pioglitazone1.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0010.0000.0001.0000.0000.0000.0000.0000.0001.0000.0000.0000.0000.0000.0000.0000.0000.0000.0050.0000.0000.0000.0000.0050.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
glipizide0.0400.0220.0000.0070.0120.0370.2090.0020.2060.0280.0230.0190.0420.0001.0000.0000.0620.0150.0340.0540.0490.0000.0000.0140.0070.0240.0420.0090.0130.0000.0120.0000.0210.0170.0290.0140.0150.0100.0270.0370.0000.0020.0000.013
glipizide-metformin0.0170.0000.0000.0000.0000.0000.0070.0000.0040.0100.0040.0050.0000.0000.0001.0000.0000.0300.0001.0000.0000.0000.0000.0000.0000.0080.0000.0000.0000.0000.0000.0000.0240.0270.0000.0000.0010.0000.0000.0050.0000.0000.0000.000
glyburide0.0320.0070.0000.0190.0070.0500.1910.0000.1870.0510.0540.0230.0400.0000.0620.0001.0000.0040.0540.0320.0930.0000.0000.0000.0110.0200.0300.0070.0240.0000.0200.0050.0440.0400.0160.0170.0040.0140.0250.0330.0000.0000.0000.004
glyburide-metformin0.0000.0040.0000.0170.0270.0100.0430.0000.0450.0150.0290.0000.0040.0000.0150.0300.0041.0000.0050.0290.0120.0000.0000.0000.0040.0060.0030.0000.0120.0200.0000.0000.0320.0390.0180.0180.0040.0030.0020.0030.0000.0000.0000.000
insulin0.1530.0110.0000.0410.0640.0680.6410.0100.5850.0780.1020.0000.0100.0000.0340.0000.0540.0051.0000.2230.0320.0000.0030.0040.0040.0730.1430.0230.0780.0170.0440.0180.1180.1310.0090.0420.0500.0180.0130.0790.0080.0000.0000.055
max_glu_serum0.3700.0161.0000.1480.1230.1330.2480.0000.1910.0710.1770.0000.0001.0000.0541.0000.0320.0290.2231.0000.0481.0001.0001.0000.0240.1500.1370.0360.0540.0140.0780.0000.1570.1140.0130.0000.0540.0250.0180.1380.0000.0001.0001.000
metformin0.0540.0130.0000.0290.0320.0660.3290.0030.2700.0360.0280.0000.0280.0000.0490.0000.0930.0120.0320.0481.0000.0410.0000.0080.0130.0370.0440.0230.0460.0000.0320.0070.0210.0440.0340.0120.0220.0090.0610.0280.0080.0050.0000.014
metformin-pioglitazone1.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0140.0000.0000.0000.0000.0000.0000.0000.0001.0000.0411.0000.0000.0000.0000.0010.0000.0000.0050.0000.0000.0000.0000.0150.0100.0000.0000.0000.0000.0000.0000.0000.0000.000
metformin-rosiglitazone0.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0110.0020.0000.0000.0000.0000.0000.0000.0031.0000.0000.0001.0000.0000.0000.0000.0380.0060.0000.0000.0000.0000.0230.0150.0000.0280.0000.0000.0000.0000.0000.0000.0000.000
miglitol0.0000.0010.0000.0000.0050.0040.0140.0000.0090.0040.0050.0040.0130.0000.0140.0000.0000.0000.0041.0000.0080.0000.0001.0000.0050.0000.0000.0000.0000.0000.0030.0000.0090.0000.0000.0000.0050.0080.0000.0090.0000.0000.0000.000
nateglinide0.0000.0000.0000.0070.0120.0080.0550.0000.0450.0060.0220.0000.0090.0000.0070.0000.0110.0040.0040.0240.0130.0000.0000.0051.0000.0060.0150.0010.0350.0210.0000.0000.0180.0130.0200.0100.0000.0000.0090.0050.0000.0000.0000.004
num_lab_procedures0.0300.0000.0040.136-0.2240.0230.0700.0000.0430.059-0.0090.0170.0190.0000.0240.0080.0200.0060.0730.1500.0370.0010.0000.0000.0061.0000.2520.0230.1690.0060.041-0.0240.0270.0470.0180.0410.0320.0200.0110.3370.0000.0060.0000.038
num_medications0.0290.0130.019-0.0630.0870.0600.2440.0000.1960.1710.1020.0360.0290.0000.0420.0000.0300.0030.1430.1370.0440.0000.0380.0000.0150.2521.0000.3520.2940.0440.0990.0740.0450.0380.0430.0300.0630.0160.0320.4650.0000.0000.0000.008
num_procedures0.0240.0040.013-0.2050.2170.0650.0270.0030.0300.013-0.0310.0450.0070.0000.0090.0000.0070.0000.0230.0360.0230.0000.0060.0000.0010.0230.3521.0000.067-0.046-0.064-0.024-0.0190.0430.0100.0250.0370.0000.0080.1870.0070.0000.0000.011
number_diagnoses0.1120.0000.0000.106-0.1270.1310.0570.0060.0320.1510.2930.0000.0100.0050.0130.0000.0240.0120.0780.0540.0460.0050.0000.0000.0350.1690.2940.0671.0000.0920.1360.1130.2400.0790.0100.0630.0820.0220.0080.2370.0090.0000.0000.022
number_emergency0.0070.0000.0000.104-0.0330.0270.0150.0000.0070.0070.1310.0000.0090.0000.0000.0000.0000.0200.0170.0140.0000.0000.0000.0000.0210.0060.044-0.0460.0921.0000.2220.1770.1130.0340.0000.0040.0290.0000.000-0.0010.0000.0000.0000.000
number_inpatient0.0220.0070.0000.056-0.0450.0490.0170.0000.0180.0850.0370.0080.0000.0000.0120.0000.0200.0000.0440.0780.0320.0000.0000.0030.0000.0410.099-0.0640.1360.2221.0000.1560.0260.0290.0110.0140.1300.0000.0080.0920.0000.0000.0000.014
number_outpatient0.0190.0000.0000.0240.0300.0040.0150.0000.0010.0330.1510.0000.0000.0000.0000.0000.0050.0000.0180.0000.0070.0000.0000.0000.000-0.0240.074-0.0240.1130.1770.1561.0000.1550.0240.0000.0120.0280.0000.000-0.0130.0000.0000.0000.019
patient_nbr0.1180.0110.0000.0300.0070.0390.1300.0060.068-0.0460.5440.0220.0230.0000.0210.0240.0440.0320.1180.1570.0210.0000.0230.0090.0180.0270.045-0.0190.2400.1130.0260.1551.0000.1740.0320.1060.1150.0420.019-0.0170.0090.0000.0000.037
payer_code0.1580.0100.0000.0810.1350.1530.1480.0030.0950.0940.2440.0610.0330.0050.0170.0270.0400.0390.1310.1140.0440.0150.0150.0000.0130.0470.0380.0430.0790.0340.0290.0240.1741.0000.0310.0870.0490.0250.0150.0330.0000.0000.0000.054
pioglitazone0.0140.0070.0000.0170.0200.0300.2030.0000.1520.0240.0360.0040.0260.0000.0290.0000.0160.0180.0090.0130.0340.0100.0000.0000.0200.0180.0430.0100.0100.0000.0110.0000.0320.0311.0000.0150.0110.0150.0370.0230.0000.0000.0000.019
race0.0640.0070.0000.0740.0630.0850.0210.0030.0220.0280.0780.0540.0140.0000.0140.0000.0170.0180.0420.0000.0120.0000.0280.0000.0100.0410.0300.0250.0630.0040.0140.0120.1060.0870.0151.0000.0370.0160.0060.0130.0000.0000.0000.036
readmitted0.0190.0120.0000.0560.0440.0380.0460.0040.0610.1200.0730.0130.0070.0000.0150.0010.0040.0040.0500.0540.0220.0000.0000.0050.0000.0320.0630.0370.0820.0290.1300.0280.1150.0490.0110.0371.0000.0160.0130.0480.0020.0000.0000.035
repaglinide0.0270.0120.0000.0180.0340.0290.0780.0000.0680.0160.0190.0000.0030.0000.0100.0000.0140.0030.0180.0250.0090.0000.0000.0080.0000.0200.0160.0000.0220.0000.0000.0000.0420.0250.0150.0160.0161.0000.0060.0240.0000.0000.0000.000
rosiglitazone0.0150.0020.0000.0210.0190.0260.1960.0000.1410.0170.0440.0110.0250.0000.0270.0000.0250.0020.0130.0180.0610.0000.0000.0000.0090.0110.0320.0080.0080.0000.0080.0000.0190.0150.0370.0060.0130.0061.0000.0210.0000.0000.0030.004
time_in_hospital0.0220.0070.0190.003-0.0150.0430.1150.0030.0700.276-0.0600.0280.0250.0000.0370.0050.0330.0030.0790.1380.0280.0000.0000.0090.0050.3370.4650.1870.237-0.0010.092-0.013-0.0170.0330.0230.0130.0480.0240.0211.0000.0000.0000.0130.010
tolazamide0.0030.0000.0000.0000.0060.0000.0000.0000.0100.0170.0140.0030.0000.0000.0000.0000.0000.0000.0080.0000.0080.0000.0000.0000.0000.0000.0000.0070.0090.0000.0000.0000.0090.0000.0000.0000.0020.0000.0000.0001.0000.0000.0000.000
tolbutamide0.0030.0000.0000.0040.0130.0140.0000.0000.0070.0100.0100.0000.0000.0000.0020.0000.0000.0000.0000.0000.0050.0000.0000.0000.0000.0060.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0001.0000.0000.000
troglitazone1.0000.0000.0000.0000.0000.0000.0030.0000.0000.0110.0130.0040.0050.0000.0000.0000.0000.0000.0001.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0030.0130.0000.0001.0000.000
weight0.0160.0000.0000.0310.0430.0260.0480.0000.0360.0160.0200.0270.0070.0000.0130.0000.0040.0000.0551.0000.0140.0000.0000.0000.0040.0380.0080.0110.0220.0000.0140.0190.0370.0540.0190.0360.0350.0000.0040.0100.0000.0000.0001.000

Missing values

2025-11-30T12:34:33.672834image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2025-11-30T12:34:35.932563image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2025-11-30T12:34:38.000288image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

encounter_idpatient_nbrracegenderageweightadmission_type_iddischarge_disposition_idadmission_source_idtime_in_hospitalpayer_codemedical_specialtynum_lab_proceduresnum_proceduresnum_medicationsnumber_outpatientnumber_emergencynumber_inpatientdiag_1diag_2diag_3number_diagnosesmax_glu_serumA1Cresultmetforminrepaglinidenateglinidechlorpropamideglimepirideacetohexamideglipizideglyburidetolbutamidepioglitazonerosiglitazoneacarbosemiglitoltroglitazonetolazamideexamidecitogliptoninsulinglyburide-metforminglipizide-metforminglimepiride-pioglitazonemetformin-rosiglitazonemetformin-pioglitazonechangediabetesMedreadmitted
022783928222157CaucasianFemale[0-10)?62511?Pediatrics-Endocrinology4101000250.83??1NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNO
114919055629189CaucasianFemale[10-20)?1173??59018000276250.012559NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYes>30
26441086047875AfricanAmericanFemale[20-30)?1172??11513201648250V276NaNNaNNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYesNO
350036482442376CaucasianMale[30-40)?1172??441160008250.434037NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
41668042519267CaucasianMale[40-50)?1171??51080001971572505NaNNaNNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
53575482637451CaucasianMale[50-60)?2123??316160004144112509NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYes>30
65584284259809CaucasianMale[60-70)?3124??70121000414411V457NaNNaNSteadyNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
763768114882984CaucasianMale[70-80)?1175??730120004284922508NaNNaNNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYes>30
81252248330783CaucasianFemale[80-90)?21413??68228000398427388NaNNaNNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
91573863555939CaucasianFemale[90-100)?33412?InternalMedicine333180004341984868NaNNaNNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
encounter_idpatient_nbrracegenderageweightadmission_type_iddischarge_disposition_idadmission_source_idtime_in_hospitalpayer_codemedical_specialtynum_lab_proceduresnum_proceduresnum_medicationsnumber_outpatientnumber_emergencynumber_inpatientdiag_1diag_2diag_3number_diagnosesmax_glu_serumA1Cresultmetforminrepaglinidenateglinidechlorpropamideglimepirideacetohexamideglipizideglyburidetolbutamidepioglitazonerosiglitazoneacarbosemiglitoltroglitazonetolazamideexamidecitogliptoninsulinglyburide-metforminglipizide-metforminglimepiride-pioglitazonemetformin-rosiglitazonemetformin-pioglitazonechangediabetesMedreadmitted
101756443842070140199494OtherFemale[60-70)?1172MD?466171119965854039NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYes>30
101757443842136181593374CaucasianFemale[70-80)?1175??211160014915185119NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYesNO
101758443842340120975314CaucasianFemale[80-90)?1175MC?7612201029283049NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
10175944384277886472243CaucasianMale[80-90)?1171MC?10153004357842507NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
10176044384717650375628AfricanAmericanFemale[60-70)?1176DM?451253123454384129NaNNaNNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoDownNoNoNoNoNoChYes>30
101761443847548100162476AfricanAmericanMale[70-80)?1373MC?51016000250.132914589NaN>8SteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoDownNoNoNoNoNoChYes>30
10176244384778274694222AfricanAmericanFemale[80-90)?1455MC?333180015602767879NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYesNO
10176344385414841088789CaucasianMale[70-80)?1171MC?53091003859029613NaNNaNSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoDownNoNoNoNoNoChYesNO
10176444385716631693671CaucasianFemale[80-90)?23710MCSurgery-General452210019962859989NaNNaNNoNoNoNoNoNoSteadyNoNoSteadyNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
101765443867222175429310CaucasianMale[70-80)?1176??13330005305307879NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNO